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FOREWORD 

This  report  was  prepared  by  the  Department  of  Electrical 
Engineering  of  the  University  of  Southern  California  under  USAF 
Contract  No.  F  04{695)-67-C-0109.  This  contract  was  initiated 
under  Project  No,  5?,18  University  Program,  Task  No.  10,  "Aero¬ 
space  Vehicle  Detection  and  Tracking  Systems".  The  work  was 
administered  under  the  direction  of  Space  Systems  Division.  Air 
Force  Systems  Command  with  Li.  Amoroso  acting  as  project 
officer  and  technical  support  furnished  through  the  TDPS  Office  of 
Aerospace  Corporation, 

Information  in  this  report  is  embargoes  under  the  U.  S. 
Export  Control  Act  of  1949,  administered  by  the  Department  of 
Commerce.  This  report  may  be  released  by  departments  or 
agencies  of  the  U.S,  Government  to  departments  or  agencies  of 
foreign  governments  with  which  -the  United  States,  has  defense 
treaty  commitments.  Private  individuals  or  firms  mti3t  comply 
with  Department  of  Commerce  export  control  regulations. 

Publication  of  this  report  does  net  constitute  'Air, Force 
approval  of  the  report’s  findings  or  conclusions.  It  is  published 
only  for  the  exchange  and  stimulation  of  ideas. 


ABSTRACT 


The  subject  of  this  report  is  the  derivation  of  optimum  control 
rule  for  guidance  of  a  space  vehicle  which  seeks  to  reach  a  nioving 
target.  The  available  thrust  vector  is  assumed  to  be  bounded  in  magni¬ 
tude  and  any  on-board  measurements  to  be  contaminated  by  random 
noise.  The  performance  measure  is  chosen  to  be  the  expectation  of 

terminal  miss  distance.  The  optimum  control  rule  is  shown  to  be  a  function  of  the 

< 

sign  of  the  expected  value  of  ).ine-o£-sight  angular  rotation  with  respect 
to  an  inertial  coordinate  system  and,  consequently,  measurement  of  this 
angular  rate  (e.g.  ,  by  means  of  an  angle  tracking  system)  is  sufficient 
to  implement  the  optimum  control  rule.  The  optimum  C;pntroller  is  very 
simple  and  more  suited  to  space  application  than  the  conventional  method 
of  proportional  navigation  commonly  used  m  missile  systems  operating 
in  lower  altitudes  due  to  the  desirability  of  using  reaction  jets  in  space 


vehicles. 
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1.  INTRODUCTION 


-indy  Mi  concerned  with  techniques  lor  control  of  an  interceptor 
which  ow":k'j  to  react,  a  moving  target  in  space.  When  the  control  of  the 
interceptor  is  baaed  upon  data  that  is  contaminated  with  noise,  and  when 
both  the  interceptor  and  the  target  are  assumed  to  be  point  masses,  the 
probability  of  interception  will,  in  general,  be  zero  and.  the  distance  of 
cl--,  cat  approach  will  be  a  random  variable.  Tn  such  a  circumstance  we 
may  use.  the  expected  value  of. the  miss  distance  as  a  performance  measure.' 

If  the  target  trajectory  is  available  to  the  interceptor  either  through 

.  "  .*  ■  •'./  -  . 

a  priori  knowledge  or  by  some  on-board  measurements,  the  interception 
problem  may  be  solved  by  determining  the  time  and  point  cf  closest  approa.ch 
and  choosing  a  -ont roller  which  satisfies  terminal  condition.  The  control 
nay  be  of  an  open-loop  nature.  When  the  interceptor  dynamics  are  governed 
by  linear  differential  equations  with  bounded  control  the  solution  to  the 
problem  if 3  discussed  in  I  )  J  and  [2l.  However,  in  practice  the  amount 
of  information  concerning  target  motion  is  inadequate  to  enable  the  con  ¬ 
troller  to  generate  a  reasonable  estimate  of  target  trajectory  since,  at 
any  given  time,  the  future  target  acceleration  is  unknown.  Furthermore, 
it  may  not  be  desirable  or  practical  to  measure  the  interceptor-target 
range  accurately.  Consequently,  it  is  clear  that  a  closed  loop  solution 
to  the  problem  should  be  sought.  This  is  an  old  problem  and  has  been 
solved  in  practice  by  means  of  choosing  a  reasonable  form  for  the  inter¬ 
ceptor  control  law  and  then  minimizing  the  expected  miss  distance  through 

pr-nnonn^O  FRQM 
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simulation  by  adjusting  various  parameters.  A  very  ccmmon  scheme  is 
•cicT  d  to  as  proportional  navigation  in  which  the  interceptor  attempts 
l.o  reduce  the  rotation  of  the  line-of-sight  to  zero  by  applying  control  to 
change  the  direction  of  its  velocity  vector  {it  is  called  proportions*  navi¬ 
gation  because  the  component  of  interceptor  acceleration  normal  to  its 
velocity  vector  is  made  proportional  to  the  line-of-sight  angular  r~te. 

In  a  recent  paper  [3]  it  was  shown  that  this  scheme  is  optimum  .aider 
three  very  restrictive  conditions,  namely;  (1)  in  the  absence  of  any  noise, 
(2)  having  the  a  priori  knowledge  of  flight  duration,and  (3)  when  the 
performance  criterion  is  taken  to  be  a  quadratic  function  of  the  terminal 
miss  and  the  control  vector  where  the  control  vector  is  assumed  unbounded. 
Although  this  is  an  interesting  result,  yet,  it  does  not  yield  the  solution  _to 
the  practical  problem  cited  above,  in  this  werk  an  optimum  closed  loop 
solution  to  l he  problem  is  derived  where  the  control  is  bounded  in  magni-  , 
tude  and  the  performance  measure  is  chosen  to  be  the  mathematical  expec¬ 
tation  of  the  terminal  miss  distance. 

II.  DERIVATION  OF  EQUATION  OF  MOTION 

For  this  study  the  planar  motion  of  the  target  and  the  interceptor 
wiU  be  considered.  The  solution  to  the  thrae  dimensional  problem  may 
be  obtained  in  a  similar  manner.  Figure  I  displays  the  target-interceptor 
geometry  with  respect  to  a  target  centered  coordinate  system. 
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The  motion  of  the  interceptor  during  the  interval  [  t,  t  +  §0  i  J 
shown  in  Figure  2.  The  relevant  acceleration  vectors  can  be  decomposed 
into  a  component  along  v(t)  and  a  component  normal  to  v(t).  These  com¬ 
ponents  will  be  denoted  by  the  additional  subscripts  v  and  T  respectively. 
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Figure  2 


From  Figure  2  it  is  evident  that 


v  {t  +  fit)  -  e{t)  -  y  (t»  6t)  +  p{t,  fit) 


In  order  to  obtain  an  explicit  expression  for  the  evolution  in  time  of  cr(t). 


the  auxiliary  variables  p{t,  fit)  and  y  (t,  fit)  must  be  eliminated  from  Eq.  (1). 


Consider  first  y{t,  fit)-  Since  j|«j{t)  -  a)(t  *  fit)  jj  “  0(6t)  ,  we  have 


x  I  is  the  Euclidean  norm  of  the  vector  x  in  the  plane. 


lim  0(6t) 


6t~*  0  fit 


=  c  t  .0 


iirh  o(6t)  _ 

St-0  fit 
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III.  DERIVATION  OF  THE  OPTIMAL  CONTROL  LAW 


It  will  be  supposed  in  this  section  that  the  interceptor  can  orient 
its  acceleration  vector  in  any  arbitrary  direction.  Since  we  will  mechanize 
this  system  with  feedback,  the  observable  data  must  be  specified.  Assume 
that  the  directional  information  available  on  board  consists  of  the  line-of- 
sight  direction  and  the  interceptor  velocity  and  acceleration  vectors  (these 
latter  may  be  obtained  by  using  a  set  of  accelerometers  and  integrators  cr 
by  velocity  measurements  with  respect  to  an  inertial  reference  system 
fixed  in  space.  ) 

To  be  more  precise,  denote  the  observed  portion  of  the  vehicle 
dyuamic  characteristics  by  §(t).  Since  it  is  not  reasonable  to  presume 
that  such  time  functions  as  line-of-sight  angle  can  be  measured  without 
error,  an  additive  noise  vector  n(t)  will  be  added  to  the  observations  to 
obtain  the  vector  7](t),  the  feedback  signal  to  the  controller. 

H  (t)  S(t)+  n(t)  .  (6) 

For  convenience  in  notation  denote  the  time  function  T*  (t  >;  0  <  T  <  t  by 

*ntcE). 

To  obtain  an  explicit  expression  for  the  optimum  control  rule,  we 
will  employ  the  dynamic  programming  formalism.  Let  J  denote  the  magni¬ 
tude  of  the  terminal  miss  distance  if  the  optimal  control  is  used.  By 
terminal  miss  distance  we  mean  the  euclidean  distance  between  target 


and  interceptor  at  the  end  of  the  pursuit  process.  The  scalar  J  is 


non-negative  and  is  sero  only  if  the  interceptor  hits  the  target.  If  the 
system  state  is  given  at  time  t,  the  terminal  miss  will  be  a  random 
variable  because  of  the  dependence  of  vehicle  acceleration  on  observation 
noise.  The  state  of  the  interceptor  may  be  described  in  terms  of  the  line- 
of-sight  angle,  the  range,  and  the  velocity.  If  the  expected  value  of  the  " 
miss  distance  is  used  as  the  performance  measure,  we  have 


H  =  ElJ(a(t),  V(t),  R(t»  iT|c(I)j2 


(7) 


This  is  a  terminal  control  problem,  and  consequently,  the  incre¬ 
mental  cost  is  identically  aero.  From  Bellman’s  principle  or  optimality 
[4]  we  obtain  the  following  recurrence  formula 

'  t 


H  =  E{J(o(t),  V(t),  R(t))|T]t(I)} 


=  E  j  min  E{  ,T(a(t+A),V(t+A),R{t+fl)|T)  a)}.|Tlta 

a(T) 

t  <  T<t+A 


Since  a(T),  t  <  T  <  t  +  A  is  functionally  dependent  oh  we  may  use  the 

identity 


E  {  E  { A  |  B}}  =  E{A} 


To  simplify  the  notation,  capital  letters  will  be  used  to  denote  the  magni¬ 
tude  of  the  velocity,  and  range  vectors. 
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j  to  obtain 


E{J(a(t),V(t),R(t)h  (I)}  =  min  E{J(a(fc  f  A),  V(t+  A),  R{t+A)  |7HI)}  • 

a(T)  1 

t<  T  <t  +  A 


If  J  possesses  all  necessary  derivatives, 


J(o(t+A),  V(t+A),.R(t+A))  =  J(ff(t),V(t)  E(t)} 

,  3J  <  do  a  J  dV  .  2  J  dR 

dcr(t)  dt  3V(t)  dt  5R(t)  dt 


A  +  o{A) 


(9) 


Therefore, 


n-  min  rf  3J  AfM  i  aj  dV(t)  i  5J  dR{t)  i  °tA)jn  ml 
0-  mm  El^(t}°W  +  av(tj  dt  +dR(t)  dt  A  !VlJ’ 

aVTf  -  - 

t  <  T  <  t+  A  v 


Clearly  the  rate  of  change  of  R(t)  depends  only  on  the  velocity  vector 
v(t),  and  is  thus  independent  of  the  minimization  with1  respect  to  a(t). 
Note  that 


U0) 


dV(t) 

dt 


da 


dt 


ajy(^  +  anpy(*-) 


V(t)  .  .  alNW  4  “TN1'1 

m ■“■°w  +  — vw — • 


Consequently,  we  must  seek  the  minimum  of 
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■'  l  9®  jjRftT  *“  olt)  * 


aINi£>  +  “tN*1’ 


d  J  !  , 

+ - -  la  +  a. 

av  iv  tv 


\a>} 


Only  a  may  be  chosen  at  our  discretion;  that  is  we  must  consider 


jjU  aiN(t)  .  JU_ 
t  a  a*  V(tf  r  3  V  aiv 


I  ■•>,(»} 


subject  to  the  constraint  tnat  the  interceptor  is  bounded 


V’2  +  hv  ^  c2 


The  interceptor  acceleration  is  functionally  dependent  upon  T| ^ (I ) ,  and 
may  therefore  be  factored  out  of  the  conditional  expectation.  The  optimal 
control  lies  either  on  the  boundary  of  the  constraint  set  or  it  lies  in  the 
interior  In  the  former  case  the  Lagrange  multiplier  technique  may  be 
used  to  evaluate  a  (t).  We  must  minimize 


•;i  — -i-‘ V”]  +  hvKlv'V1’} 


aiN  l  9o  \(t) 


X  f  l  .  .  .  2 

2  yaIN  (t)  +  aiv 


subject  to  the  magnitude  constraint.  The  acceleration  vector  must  then 


satisfy  the  equations 
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"  "  j'Sni  ''t  *,«  -  - 


l*w«  +  E{H-vkil'ta)}  =  ° 


X  aIV(t)+  E{|i-  |  11 1»).}  -  0 


From  the  constraint  equation 


E{w'V>}2  *  E{  v7ti  &  K”}* 


=  C 


Consequently, 


C  E{  3o  V{t)  ^  11  t(I)} 


IN 


1/2 


E{av  ^t^} 


TV  r- 


,“1  i/2 


E{vV)  If  i  V’} +  E { w>  'V’} 


The  optimal  control  will  lie  on  the  interior  of  the  constraint  set 


only  if 


e(tt-  i  V>V  e{I4  l\«}  =0 


3J 
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It  seems  reasonable  to  suppose  that  the  set  of  t  e  [0,T]  satisfying 
Eq.  (13)  is  finite  or  at  least  countable,  and  for  this  reason  such  points 
will  be  neglected  in  this  investigation. 

IV.  OPTIMAL  CONTROL  LAW  WITH  DIRECTION  CONSTRAINT 

Equation  (12)  gives  an  explicit  expression  for  the  optimal  control 
law.  Two  major  drawbacks  to  actual  implementation  of  this  controller 
become  immediately  evident.  On  the  one  hand,  the  partial  differential 
equation  for  the  conditional  expectation  for  J  (see  Eq.  (10))must  be  solved. 
Because  of  the  bound  on  the  control,  this  is  a  far  from  trivial  task. 

Secondly,  the  interceptor  must  have  some  means  of  sensing  the  direction 
of  its  velocity  vector.  For  some  vehicles  this  would  involve  an  unwarranted- 
increase  in  controller  complexity. 

One  way  to  obviate  these  difficulties  is  to  fix  the  direction  of  the 
interceptor  acceleration  with  respect  to  a  measured  direction.  Since  the- 
direction  of  the  line-of-sight  is  readily  available,  let  us  suppose  that  the 
acceleration  vector  a(t)  is  orthogonal  to  the  line-of-sight  vector.  Clearly, 
this  additional  restriction  on  the  acceleration  can  only  increase  the  per¬ 
formance  measure  studied  in  the  previous  section.  The  justification  of 
the  restriction  lies  in  the  reduced  system  complexity  obtainable. 

With  the  direction  constraint 
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a  (t)  =  a(t)coso/|t) 


aIV(t)  =  a(t)  sin  a  (fc) 


I  a(t)  |  <  c 


We  must  choose  a(t)  to  minimize 


E  {  t*  a(t)  c-|(?r  +  aw a(t>  sin  0(0 1  v* } 


Since  a(t)  is  functionally  dependent  on  'Ti^(I),  the  optimal  control  is 


a(t|  =  -c  sgn  E  (  ~~  sin  o  (t)] 


yl  &is  vslue  of  a(t)  is  placed  in  Eq.  (10),  a  partial  differential 
equation  for  the  conditional. expectation  of  J  would  result.  Solving  this 
equation  would  represent  quite  a  task  and  no  attempt  will  be  made  to 
solve  the  equation  explicitly  here.  Two  intuitively  evident  conjectures 
about  properties  of  J,  however,  do  permit  us  to  obtain  the  optimal  control 
rule.  Recalling  that  J  is  a  non-negative  function  for  all  values  of  its 
arguments,  we  might  suppose  that  J(a(t),  V(t),  R(t))  is  an  increasing 
function  of  o  (t)  for  all  positive  V(t)  and  R(t)  if  ^  >  a(t)  >  0  and  adecreasing 
function  of  o  (t)  for  all  positive  V(t)  and  R(t)  if  <  o{t)  <  0.  Then 


*  "  -  *'5 


sgn  o  (t)  . 


a  s 

'6  0  (t) 


Furthermore,  V(t)  is  always  a  positive  quantity  and  Eq.  (5) 

indicates  .hat  an  increase  in  V(t)  has  the  effect  of  reducing  the  influence 
o.l  the  i.'  erceptov  acceleration.  Consequently,  it  seems  reasonable  to 

suppose  that; 


a  3 

ov;t) 


>  o 


for  all  positive  R  (t)  and  all  a  It). 


From  the  symmetry  of  the  problem 


3  (a It),  V(t),  R(t);  =  J(-o(t),  V(t),  R(t))  . 


Therefore,  we  have  the  result  that 


oJ  cos  o  (i  i 
9  o  (t)  V(t) 


+ 


9? 


sin  a(t) 


is  an  odd  function  of  a(t).  In  fact,  if  ti  e  conditional  probability  distribution 
of  o  (t)  is  symmetric,  and  if  |<j(t)  j  <  tt/2,  _  _ 


sgn 


6J 
6  o(t) 


cos  a  (t)  ,  dJ 
V(t)  3  V(tjT 


Bin  cr(t)  |Tlt(I)| 


sgnE{a(t)|yi)]  -  ^  (15) 
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•j.i  tilting  tbits  i'i-yuii  into  Eq.  {1-i) 

a(t;  -  -  c  Sfin  e{  0  «t)  !  T)  t(D~}  -  (16) 

V.  CHOICE  OF  REQUIRED  MEASUREMENTS 

Equation  (16)  pi  ovides  the  optimal  control  rule  and  for  its  imple- 
t  aenin.  Li  or  we  have  to  determine  sgn  E{  C  (t)  |1)  (I)  I  .  However,  a  (t)  cannot 
he  measured  directly  since  (see  Figure  1)  v^,(t)  is  an  unknown  quantity  to 

the  interceptor.  From  Figure  2 

f  (t  +  at)  =  Y(t)  -  y<t.  6t)  (IV) 

Replacing  Y  (t,  fct)  by  it3  vaiue  in  Eq.  (2),  directing  both  sides  by  6(t)  and 
taking  the  limit  as  t  "0  we  have 


no 


Hu  (OB 

I'WI 


sin  a  (t) 


(18) 


v  (t)  |j  and  jjr(t)  ||  are  positive  quantities  and  !  0(t)  j  <  tt/2  then 


sgn  o  (t)  =  sgn  sin  a  (t)  =  -sgn  T(t) 


(19) 


Consequently,  the  quantity  which  should  be  measured  is  the  line  of  sight 
rotation  with  respect  to  an  inertial  coordinate  system.  This  quantity 
can  be  measured  on-board  by  means  of  a  tracking  system.  Finally,  the 
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T)(t)  ‘  HO  +  n{t)  , 

where  n(c)  is  the  measurement  noise.  When  statistics  of  ¥{t)  and  n{t» 
arc  known,  the  estimation  oi  E^(t)  |  T|  (I)jcan  be  carried  out  by  appropriate 
opiimal  filtering  [  5]  . 

A  different  procedure  in  addition  tc,  on*,  discussed  above  can  be 
used  in  the  special  case  when  the  target  is  stationary.  In  this  case 
v(t)  =  Vjtt).  The  angle  between  v^.(t)  and  an  axis  fixed  with  respect  tc 
the  vehicle  body  can  be  measured  by  means  of  appropriate  instruments 
on  board.  Furtherm  ore,  the  angle  between.  r(t)  and  the  body  fixed  axis 
can  be  measured  by  means  of  an  antenna  System  fixed  with  respect  to 
the  vehicle.  The  desired  angle  a  (t)  is  the  difference  between  these  two 
angles  The  advantage  of  this  approach  is  that  it  is  not  necessary  to 
mount  the  tracking  antenna  on  a  stable  platform. 
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VI.  ADDITIONAL  SYSTEM  DYNAMICS 


In  practice  there  may  be  a  delay  time  of  d  seconds  between  a 
value  of  T(t)  and  generation  of  the  appropriate  acceleration  given  by  (20), 

The  delay  time  d  may  be  actually  an  approximation  to  various  system 
dynamics  involved  such  as  the  tracking  loop  dynamics,  reaction  jets,  etc. 

It  is  easily  seen  that  under  this  condition  the  optimum  control  rule  given 
by  (20)  should  be  modified  to 

A(t)  =  +c  sgn  E{Y(t)iT)t  da)} 

where  1}  .  (I'<  represents  the  time  function  1)  (t  );  0  <  r  <  t-d. 

t-d  —  — 

• 

In  other  words  the  control  is  based  on  the  estimate  of  Y(t)  corres¬ 
ponding  to  the  data  received  over  the  interval  0  <  T  <  t-d. 

VII.  CONCLUSIONS 

The  optimum  control  l^w  for  a  vehicle  which  seeks  to  reach  a 
moving  target  has  been  derived  where  the  performance  criterion  is  the 
expectation  of  the  terminal  miss  distance  and  the  thrust  vector  is  bounded 
'.n  magnitude.  J.t  is  shown  that  the  optimum  control  law  is  a  function  of  the  sign  of  the 
expected  value  of  the  line-of-sight  angular  rate  with  respect  to  an  inertial 
coordinate  sy3tem. 
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